Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 1465783 |
| Missing cells | 1896895 |
| Missing cells (%) | 7.2% |
| Duplicate rows | 15832 |
| Duplicate rows (%) | 1.1% |
| Total size in memory | 1.4 GiB |
| Average record size in memory | 994.1 B |
Variable types
| Categorical | 7 |
|---|---|
| Text | 8 |
| Numeric | 3 |
| Dataset has 15832 (1.1%) duplicate rows | Duplicates |
PresentState is highly imbalanced (95.2%) | Imbalance |
PermanentAddress is highly imbalanced (> 99.9%) | Imbalance |
PermanentState is highly imbalanced (95.2%) | Imbalance |
InjuryType has 456136 (31.1%) missing values | Missing |
Injury_Nature has 1438422 (98.1%) missing values | Missing |
age has 70740 (4.8%) zeros | Zeros |
Reproduction
| Analysis started | 2024-04-14 05:57:28.324118 |
|---|---|
| Analysis finished | 2024-04-14 06:00:25.380735 |
| Duration | 2 minutes and 57.06 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
District_Name
Categorical
| Distinct | 41 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 94.7 MiB |
| Bengaluru City | |
|---|---|
| Tumakuru | 66879 |
| Hassan | 64399 |
| Belagavi Dist | 60516 |
| Bengaluru Dist | 60000 |
| Other values (36) |
Length
| Max length | 23 |
|---|---|
| Median length | 18 |
| Mean length | 10.744256 |
| Min length | 3 |
Characters and Unicode
| Total characters | 15748748 |
|---|---|
| Distinct characters | 40 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Bagalkot |
|---|---|
| 2nd row | Bagalkot |
| 3rd row | Bagalkot |
| 4th row | Bagalkot |
| 5th row | Bagalkot |
Common Values
| Value | Count | Frequency (%) |
| Bengaluru City | 293958 | |
| Tumakuru | 66879 | 4.6% |
| Hassan | 64399 | 4.4% |
| Belagavi Dist | 60516 | 4.1% |
| Bengaluru Dist | 60000 | 4.1% |
| Shivamogga | 58342 | 4.0% |
| Mandya | 54474 | 3.7% |
| Chitradurga | 46664 | 3.2% |
| Mysuru Dist | 45672 | 3.1% |
| Davanagere | 41943 | 2.9% |
| Other values (31) | 672936 |
Length
| Value | Count | Frequency (%) |
| city | 406583 | |
| bengaluru | 353959 | |
| dist | 166188 | 7.8% |
| belagavi | 80061 | 3.8% |
| mysuru | 76206 | 3.6% |
| tumakuru | 66879 | 3.2% |
| hassan | 64399 | 3.0% |
| shivamogga | 58342 | 2.8% |
| kannada | 54478 | 2.6% |
| mandya | 54474 | 2.6% |
| Other values (33) | 738218 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2745052 | |
| u | 1428766 | 9.1% |
| r | 1138016 | 7.2% |
| i | 1131101 | 7.2% |
| g | 907428 | 5.8% |
| l | 764968 | 4.9% |
| n | 761944 | 4.8% |
| t | 718336 | 4.6% |
| 654004 | 4.2% | |
| y | 597325 | 3.8% |
| Other values (30) | 4901808 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 15748748 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 2745052 | |
| u | 1428766 | 9.1% |
| r | 1138016 | 7.2% |
| i | 1131101 | 7.2% |
| g | 907428 | 5.8% |
| l | 764968 | 4.9% |
| n | 761944 | 4.8% |
| t | 718336 | 4.6% |
| 654004 | 4.2% | |
| y | 597325 | 3.8% |
| Other values (30) | 4901808 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 15748748 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 2745052 | |
| u | 1428766 | 9.1% |
| r | 1138016 | 7.2% |
| i | 1131101 | 7.2% |
| g | 907428 | 5.8% |
| l | 764968 | 4.9% |
| n | 761944 | 4.8% |
| t | 718336 | 4.6% |
| 654004 | 4.2% | |
| y | 597325 | 3.8% |
| Other values (30) | 4901808 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 15748748 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 2745052 | |
| u | 1428766 | 9.1% |
| r | 1138016 | 7.2% |
| i | 1131101 | 7.2% |
| g | 907428 | 5.8% |
| l | 764968 | 4.9% |
| n | 761944 | 4.8% |
| t | 718336 | 4.6% |
| 654004 | 4.2% | |
| y | 597325 | 3.8% |
| Other values (30) | 4901808 |
UnitName
Text
| Distinct | 1044 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 100.1 MiB |
Length
| Max length | 44 |
|---|---|
| Median length | 30 |
| Mean length | 14.637915 |
| Min length | 3 |
Characters and Unicode
| Total characters | 21456007 |
|---|---|
| Distinct characters | 54 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Amengad PS |
|---|---|
| 2nd row | Amengad PS |
| 3rd row | Amengad PS |
| 4th row | Amengad PS |
| 5th row | Amengad PS |
| Value | Count | Frequency (%) |
| ps | 1459100 | |
| rural | 157282 | 4.3% |
| traffic | 131192 | 3.6% |
| town | 81321 | 2.2% |
| crime | 51770 | 1.4% |
| cen | 50795 | 1.4% |
| nagar | 39496 | 1.1% |
| women | 24072 | 0.7% |
| south | 17104 | 0.5% |
| layout | 15292 | 0.4% |
| Other values (823) | 1618665 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3262872 | |
| 2211315 | 10.3% | |
| S | 1637267 | 7.6% |
| P | 1521395 | 7.1% |
| r | 1390312 | 6.5% |
| i | 1008860 | 4.7% |
| l | 910718 | 4.2% |
| n | 887906 | 4.1% |
| u | 865220 | 4.0% |
| e | 682382 | 3.2% |
| Other values (44) | 7077760 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 21456007 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 3262872 | |
| 2211315 | 10.3% | |
| S | 1637267 | 7.6% |
| P | 1521395 | 7.1% |
| r | 1390312 | 6.5% |
| i | 1008860 | 4.7% |
| l | 910718 | 4.2% |
| n | 887906 | 4.1% |
| u | 865220 | 4.0% |
| e | 682382 | 3.2% |
| Other values (44) | 7077760 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 21456007 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 3262872 | |
| 2211315 | 10.3% | |
| S | 1637267 | 7.6% |
| P | 1521395 | 7.1% |
| r | 1390312 | 6.5% |
| i | 1008860 | 4.7% |
| l | 910718 | 4.2% |
| n | 887906 | 4.1% |
| u | 865220 | 4.0% |
| e | 682382 | 3.2% |
| Other values (44) | 7077760 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 21456007 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 3262872 | |
| 2211315 | 10.3% | |
| S | 1637267 | 7.6% |
| P | 1521395 | 7.1% |
| r | 1390312 | 6.5% |
| i | 1008860 | 4.7% |
| l | 910718 | 4.2% |
| n | 887906 | 4.1% |
| u | 865220 | 4.0% |
| e | 682382 | 3.2% |
| Other values (44) | 7077760 |
Year
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2019.913 |
| Minimum | 2016 |
|---|---|
| Maximum | 2024 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.2 MiB |
Quantile statistics
| Minimum | 2016 |
|---|---|
| 5-th percentile | 2016 |
| Q1 | 2018 |
| median | 2020 |
| Q3 | 2022 |
| 95-th percentile | 2023 |
| Maximum | 2024 |
| Range | 8 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.3788086 |
|---|---|
| Coefficient of variation (CV) | 0.0011776787 |
| Kurtosis | -1.2089886 |
| Mean | 2019.913 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.10328035 |
| Sum | 2.9607542 × 109 |
| Variance | 5.6587306 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2023 | 227315 | |
| 2022 | 202292 | |
| 2021 | 177218 | |
| 2019 | 174211 | |
| 2018 | 170560 | |
| 2020 | 166943 | |
| 2017 | 162718 | |
| 2016 | 140729 | |
| 2024 | 43797 | 3.0% |
| Value | Count | Frequency (%) |
| 2016 | 140729 | |
| 2017 | 162718 | |
| 2018 | 170560 | |
| 2019 | 174211 | |
| 2020 | 166943 | |
| 2021 | 177218 | |
| 2022 | 202292 | |
| 2023 | 227315 | |
| 2024 | 43797 | 3.0% |
| Value | Count | Frequency (%) |
| 2024 | 43797 | 3.0% |
| 2023 | 227315 | |
| 2022 | 202292 | |
| 2021 | 177218 | |
| 2020 | 166943 | |
| 2019 | 174211 | |
| 2018 | 170560 | |
| 2017 | 162718 | |
| 2016 | 140729 |
Month
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.3513944 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.5109914 |
|---|---|
| Coefficient of variation (CV) | 0.55279065 |
| Kurtosis | -1.2476962 |
| Mean | 6.3513944 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.055712088 |
| Sum | 9309766 |
| Variance | 12.327061 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 137944 | |
| 2 | 134529 | |
| 3 | 131974 | |
| 12 | 124326 | |
| 5 | 122591 | |
| 6 | 120906 | |
| 10 | 118128 | |
| 11 | 116501 | |
| 8 | 115438 | |
| 7 | 115325 | |
| Other values (2) | 228121 |
| Value | Count | Frequency (%) |
| 1 | 137944 | |
| 2 | 134529 | |
| 3 | 131974 | |
| 4 | 114024 | |
| 5 | 122591 | |
| 6 | 120906 | |
| 7 | 115325 | |
| 8 | 115438 | |
| 9 | 114097 | |
| 10 | 118128 |
| Value | Count | Frequency (%) |
| 12 | 124326 | |
| 11 | 116501 | |
| 10 | 118128 | |
| 9 | 114097 | |
| 8 | 115438 | |
| 7 | 115325 | |
| 6 | 120906 | |
| 5 | 122591 | |
| 4 | 114024 | |
| 3 | 131974 |
age
Real number (ℝ)
ZEROS 
| Distinct | 113 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.333776 |
| Minimum | -19 |
|---|---|
| Maximum | 665 |
| Zeros | 70740 |
| Zeros (%) | 4.8% |
| Negative | 14 |
| Negative (%) | < 0.1% |
| Memory size | 11.2 MiB |
Quantile statistics
| Minimum | -19 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 24 |
| median | 33 |
| Q3 | 45 |
| 95-th percentile | 64 |
| Maximum | 665 |
| Range | 684 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 16.607518 |
|---|---|
| Coefficient of variation (CV) | 0.48370789 |
| Kurtosis | 1.6627866 |
| Mean | 34.333776 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 0.22427884 |
| Sum | 50325865 |
| Variance | 275.80966 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 70740 | 4.8% |
| 35 | 64017 | 4.4% |
| 30 | 61242 | 4.2% |
| 45 | 56173 | 3.8% |
| 40 | 54790 | 3.7% |
| 25 | 48681 | 3.3% |
| 28 | 48162 | 3.3% |
| 32 | 43527 | 3.0% |
| 38 | 38821 | 2.6% |
| 50 | 38590 | 2.6% |
| Other values (103) | 941040 |
| Value | Count | Frequency (%) |
| -19 | 1 | < 0.1% |
| -18 | 13 | < 0.1% |
| 0 | 70740 | |
| 1 | 2944 | 0.2% |
| 2 | 2543 | 0.2% |
| 3 | 3129 | 0.2% |
| 4 | 3591 | 0.2% |
| 5 | 3525 | 0.2% |
| 6 | 3644 | 0.2% |
| 7 | 3207 | 0.2% |
| Value | Count | Frequency (%) |
| 665 | 1 | < 0.1% |
| 448 | 1 | < 0.1% |
| 120 | 1 | < 0.1% |
| 110 | 2 | < 0.1% |
| 109 | 1 | < 0.1% |
| 105 | 3 | < 0.1% |
| 104 | 1 | < 0.1% |
| 103 | 3 | < 0.1% |
| 102 | 8 | |
| 101 | 3 | < 0.1% |
Caste
Text
| Distinct | 989 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 91.2 MiB |
Length
| Max length | 52 |
|---|---|
| Median length | 43 |
| Mean length | 8.2739028 |
| Min length | 3 |
Characters and Unicode
| Total characters | 12127746 |
|---|---|
| Distinct characters | 55 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Lingayath |
|---|---|
| 2nd row | VOKKALIGA |
| 3rd row | VOKKALIGA |
| 4th row | VOKKALIGA |
| 5th row | VOKKALIGA |
| Value | Count | Frequency (%) |
| vokkaliga | 335944 | |
| lingayath | 143937 | 8.5% |
| muslim | 137765 | 8.1% |
| adi | 111940 | 6.6% |
| karnataka | 93169 | 5.5% |
| kuruba | 44856 | 2.6% |
| achari | 41325 | 2.4% |
| nayaka | 38365 | 2.3% |
| lambani | 31739 | 1.9% |
| brahmin | 27141 | 1.6% |
| Other values (1069) | 687604 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 2497215 | |
| K | 1079867 | 8.9% |
| I | 1027586 | 8.5% |
| L | 855437 | 7.1% |
| R | 524018 | 4.3% |
| G | 518439 | 4.3% |
| O | 495271 | 4.1% |
| M | 481889 | 4.0% |
| V | 479613 | 4.0% |
| U | 365712 | 3.0% |
| Other values (45) | 3802699 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12127746 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 2497215 | |
| K | 1079867 | 8.9% |
| I | 1027586 | 8.5% |
| L | 855437 | 7.1% |
| R | 524018 | 4.3% |
| G | 518439 | 4.3% |
| O | 495271 | 4.1% |
| M | 481889 | 4.0% |
| V | 479613 | 4.0% |
| U | 365712 | 3.0% |
| Other values (45) | 3802699 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12127746 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 2497215 | |
| K | 1079867 | 8.9% |
| I | 1027586 | 8.5% |
| L | 855437 | 7.1% |
| R | 524018 | 4.3% |
| G | 518439 | 4.3% |
| O | 495271 | 4.1% |
| M | 481889 | 4.0% |
| V | 479613 | 4.0% |
| U | 365712 | 3.0% |
| Other values (45) | 3802699 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12127746 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 2497215 | |
| K | 1079867 | 8.9% |
| I | 1027586 | 8.5% |
| L | 855437 | 7.1% |
| R | 524018 | 4.3% |
| G | 518439 | 4.3% |
| O | 495271 | 4.1% |
| M | 481889 | 4.0% |
| V | 479613 | 4.0% |
| U | 365712 | 3.0% |
| Other values (45) | 3802699 |
Profession
Text
| Distinct | 190 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 94.7 MiB |
Length
| Max length | 41 |
|---|---|
| Median length | 27 |
| Mean length | 10.714786 |
| Min length | 3 |
Characters and Unicode
| Total characters | 15705551 |
|---|---|
| Distinct characters | 56 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 22 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Farmer |
|---|---|
| 2nd row | Farmer |
| 3rd row | Farmer |
| 4th row | Farmer |
| 5th row | Farmer |
| Value | Count | Frequency (%) |
| farmer | 385365 | |
| labourer | 198976 | 9.5% |
| housewife | 177718 | 8.5% |
| others | 163045 | 7.8% |
| pi | 150246 | 7.1% |
| specify | 150246 | 7.1% |
| student | 118573 | 5.6% |
| businessman | 65816 | 3.1% |
| officer | 52608 | 2.5% |
| police | 51682 | 2.5% |
| Other values (226) | 588355 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2023553 | 12.9% |
| 1829878 | 11.7% | |
| r | 1795060 | 11.4% |
| a | 878460 | 5.6% |
| i | 805060 | 5.1% |
| o | 733922 | 4.7% |
| s | 650939 | 4.1% |
| u | 618190 | 3.9% |
| t | 616093 | 3.9% |
| m | 570605 | 3.6% |
| Other values (46) | 5183791 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 15705551 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 2023553 | 12.9% |
| 1829878 | 11.7% | |
| r | 1795060 | 11.4% |
| a | 878460 | 5.6% |
| i | 805060 | 5.1% |
| o | 733922 | 4.7% |
| s | 650939 | 4.1% |
| u | 618190 | 3.9% |
| t | 616093 | 3.9% |
| m | 570605 | 3.6% |
| Other values (46) | 5183791 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 15705551 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 2023553 | 12.9% |
| 1829878 | 11.7% | |
| r | 1795060 | 11.4% |
| a | 878460 | 5.6% |
| i | 805060 | 5.1% |
| o | 733922 | 4.7% |
| s | 650939 | 4.1% |
| u | 618190 | 3.9% |
| t | 616093 | 3.9% |
| m | 570605 | 3.6% |
| Other values (46) | 5183791 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 15705551 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 2023553 | 12.9% |
| 1829878 | 11.7% | |
| r | 1795060 | 11.4% |
| a | 878460 | 5.6% |
| i | 805060 | 5.1% |
| o | 733922 | 4.7% |
| s | 650939 | 4.1% |
| u | 618190 | 3.9% |
| t | 616093 | 3.9% |
| m | 570605 | 3.6% |
| Other values (46) | 5183791 |
Sex
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 227 |
| Missing (%) | < 0.1% |
| Memory size | 86.2 MiB |
| MALE | |
|---|---|
| FEMALE | |
| Enuch | 469 |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.6416657 |
| Min length | 4 |
Characters and Unicode
| Total characters | 6802621 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FEMALE |
|---|---|
| 2nd row | MALE |
| 3rd row | MALE |
| 4th row | MALE |
| 5th row | MALE |
Common Values
| Value | Count | Frequency (%) |
| MALE | 995123 | |
| FEMALE | 469964 | |
| Enuch | 469 | < 0.1% |
| (Missing) | 227 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 995123 | |
| female | 469964 | |
| enuch | 469 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1935520 | |
| M | 1465087 | |
| A | 1465087 | |
| L | 1465087 | |
| F | 469964 | 6.9% |
| n | 469 | < 0.1% |
| u | 469 | < 0.1% |
| c | 469 | < 0.1% |
| h | 469 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6802621 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 1935520 | |
| M | 1465087 | |
| A | 1465087 | |
| L | 1465087 | |
| F | 469964 | 6.9% |
| n | 469 | < 0.1% |
| u | 469 | < 0.1% |
| c | 469 | < 0.1% |
| h | 469 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6802621 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 1935520 | |
| M | 1465087 | |
| A | 1465087 | |
| L | 1465087 | |
| F | 469964 | 6.9% |
| n | 469 | < 0.1% |
| u | 469 | < 0.1% |
| c | 469 | < 0.1% |
| h | 469 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6802621 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 1935520 | |
| M | 1465087 | |
| A | 1465087 | |
| L | 1465087 | |
| F | 469964 | 6.9% |
| n | 469 | < 0.1% |
| u | 469 | < 0.1% |
| c | 469 | < 0.1% |
| h | 469 | < 0.1% |
PresentAddress
Text
| Distinct | 1034708 |
|---|---|
| Distinct (%) | 70.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 134.6 MiB |
Length
| Max length | 101 |
|---|---|
| Median length | 80 |
| Mean length | 39.296896 |
| Min length | 1 |
Characters and Unicode
| Total characters | 57600722 |
|---|---|
| Distinct characters | 111 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 885717 ? |
|---|---|
| Unique (%) | 60.4% |
Sample
| 1st row | HUVINAHALLI,TQ-HUANGUND |
|---|---|
| 2nd row | BASAVA NAGAR GOKAK CTS 190/5 PLAT NO 2,GOKAK |
| 3rd row | BASAVA NAGAR GOKAK CTS 190/5 PLAT NO 2,GOKAK |
| 4th row | BASAVA NAGAR GOKAK CTS 190/5 PLAT NO 2,TQ-GOKAK |
| 5th row | BASAVA NAGAR GOKAK CTS 190/5 PLAT NO 2,TQ-GOKAK |
| Value | Count | Frequency (%) |
| tq | 259100 | 3.8% |
| taluk | 171865 | 2.5% |
| village | 133494 | 2.0% |
| cross | 111413 | 1.7% |
| no | 110460 | 1.6% |
| r/o | 84642 | 1.3% |
| 83782 | 1.2% | |
| town | 81032 | 1.2% |
| main | 79676 | 1.2% |
| road | 75665 | 1.1% |
| Other values (735756) | 5554425 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5467165 | 9.5% |
| 5381078 | 9.3% | |
| A | 3484201 | 6.0% |
| , | 2404683 | 4.2% |
| l | 2350038 | 4.1% |
| i | 1936167 | 3.4% |
| r | 1714304 | 3.0% |
| N | 1536076 | 2.7% |
| T | 1521154 | 2.6% |
| e | 1518070 | 2.6% |
| Other values (101) | 30287786 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 57600722 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 5467165 | 9.5% |
| 5381078 | 9.3% | |
| A | 3484201 | 6.0% |
| , | 2404683 | 4.2% |
| l | 2350038 | 4.1% |
| i | 1936167 | 3.4% |
| r | 1714304 | 3.0% |
| N | 1536076 | 2.7% |
| T | 1521154 | 2.6% |
| e | 1518070 | 2.6% |
| Other values (101) | 30287786 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 57600722 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 5467165 | 9.5% |
| 5381078 | 9.3% | |
| A | 3484201 | 6.0% |
| , | 2404683 | 4.2% |
| l | 2350038 | 4.1% |
| i | 1936167 | 3.4% |
| r | 1714304 | 3.0% |
| N | 1536076 | 2.7% |
| T | 1521154 | 2.6% |
| e | 1518070 | 2.6% |
| Other values (101) | 30287786 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 57600722 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 5467165 | 9.5% |
| 5381078 | 9.3% | |
| A | 3484201 | 6.0% |
| , | 2404683 | 4.2% |
| l | 2350038 | 4.1% |
| i | 1936167 | 3.4% |
| r | 1714304 | 3.0% |
| N | 1536076 | 2.7% |
| T | 1521154 | 2.6% |
| e | 1518070 | 2.6% |
| Other values (101) | 30287786 |
PresentCity
Text
| Distinct | 693 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 105 |
| Missing (%) | < 0.1% |
| Memory size | 94.5 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 23 |
| Mean length | 10.614335 |
| Min length | 3 |
Characters and Unicode
| Total characters | 15557198 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 52 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Bagalkot |
|---|---|
| 2nd row | Bagalkot |
| 3rd row | Bagalkot |
| 4th row | Belagavi Dist |
| 5th row | Belagavi Dist |
| Value | Count | Frequency (%) |
| city | 410429 | |
| bengaluru | 360518 | |
| dist | 159706 | 7.6% |
| belagavi | 78240 | 3.7% |
| mysuru | 75604 | 3.6% |
| hassan | 63267 | 3.0% |
| tumakuru | 62450 | 3.0% |
| shivamogga | 58104 | 2.8% |
| mandya | 52865 | 2.5% |
| kannada | 50500 | 2.4% |
| Other values (683) | 738144 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2618344 | |
| u | 1430994 | 9.2% |
| r | 1136119 | 7.3% |
| i | 1124739 | 7.2% |
| g | 890011 | 5.7% |
| l | 792070 | 5.1% |
| n | 747638 | 4.8% |
| t | 709241 | 4.6% |
| 644195 | 4.1% | |
| y | 579936 | 3.7% |
| Other values (48) | 4883911 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 15557198 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 2618344 | |
| u | 1430994 | 9.2% |
| r | 1136119 | 7.3% |
| i | 1124739 | 7.2% |
| g | 890011 | 5.7% |
| l | 792070 | 5.1% |
| n | 747638 | 4.8% |
| t | 709241 | 4.6% |
| 644195 | 4.1% | |
| y | 579936 | 3.7% |
| Other values (48) | 4883911 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 15557198 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 2618344 | |
| u | 1430994 | 9.2% |
| r | 1136119 | 7.3% |
| i | 1124739 | 7.2% |
| g | 890011 | 5.7% |
| l | 792070 | 5.1% |
| n | 747638 | 4.8% |
| t | 709241 | 4.6% |
| 644195 | 4.1% | |
| y | 579936 | 3.7% |
| Other values (48) | 4883911 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 15557198 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 2618344 | |
| u | 1430994 | 9.2% |
| r | 1136119 | 7.3% |
| i | 1124739 | 7.2% |
| g | 890011 | 5.7% |
| l | 792070 | 5.1% |
| n | 747638 | 4.8% |
| t | 709241 | 4.6% |
| 644195 | 4.1% | |
| y | 579936 | 3.7% |
| Other values (48) | 4883911 |
PresentState
Categorical
IMBALANCE 
| Distinct | 38 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 948 |
| Missing (%) | 0.1% |
| Memory size | 92.3 MiB |
| Karnataka | |
|---|---|
| Maharashtra | 8441 |
| Andhra pradesh | 8063 |
| Tamilnadu | 5099 |
| Kerala | 3220 |
| Other values (33) | 11083 |
Length
| Max length | 20 |
|---|---|
| Median length | 9 |
| Mean length | 9.0309359 |
| Min length | 3 |
Characters and Unicode
| Total characters | 13228831 |
|---|---|
| Distinct characters | 45 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Karnataka |
|---|---|
| 2nd row | Karnataka |
| 3rd row | Karnataka |
| 4th row | Karnataka |
| 5th row | Karnataka |
Common Values
| Value | Count | Frequency (%) |
| Karnataka | 1428929 | |
| Maharashtra | 8441 | 0.6% |
| Andhra pradesh | 8063 | 0.6% |
| Tamilnadu | 5099 | 0.3% |
| Kerala | 3220 | 0.2% |
| Telangana | 1757 | 0.1% |
| Uttar pradesh | 1435 | 0.1% |
| West bengal | 1303 | 0.1% |
| Bihar | 1263 | 0.1% |
| Rajasthan | 712 | < 0.1% |
| Other values (28) | 4613 | 0.3% |
| (Missing) | 948 | 0.1% |
Length
| Value | Count | Frequency (%) |
| karnataka | 1428929 | |
| pradesh | 10136 | 0.7% |
| maharashtra | 8441 | 0.6% |
| andhra | 8063 | 0.5% |
| tamilnadu | 5099 | 0.3% |
| kerala | 3220 | 0.2% |
| telangana | 1757 | 0.1% |
| uttar | 1435 | 0.1% |
| west | 1303 | 0.1% |
| bengal | 1303 | 0.1% |
| Other values (38) | 6868 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5802330 | |
| r | 1472350 | 11.1% |
| n | 1448958 | 11.0% |
| t | 1443416 | 10.9% |
| K | 1432149 | 10.8% |
| k | 1429464 | 10.8% |
| h | 40015 | 0.3% |
| d | 24458 | 0.2% |
| s | 22951 | 0.2% |
| e | 18173 | 0.1% |
| Other values (35) | 94567 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 13228831 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 5802330 | |
| r | 1472350 | 11.1% |
| n | 1448958 | 11.0% |
| t | 1443416 | 10.9% |
| K | 1432149 | 10.8% |
| k | 1429464 | 10.8% |
| h | 40015 | 0.3% |
| d | 24458 | 0.2% |
| s | 22951 | 0.2% |
| e | 18173 | 0.1% |
| Other values (35) | 94567 | 0.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 13228831 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 5802330 | |
| r | 1472350 | 11.1% |
| n | 1448958 | 11.0% |
| t | 1443416 | 10.9% |
| K | 1432149 | 10.8% |
| k | 1429464 | 10.8% |
| h | 40015 | 0.3% |
| d | 24458 | 0.2% |
| s | 22951 | 0.2% |
| e | 18173 | 0.1% |
| Other values (35) | 94567 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 13228831 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 5802330 | |
| r | 1472350 | 11.1% |
| n | 1448958 | 11.0% |
| t | 1443416 | 10.9% |
| K | 1432149 | 10.8% |
| k | 1429464 | 10.8% |
| h | 40015 | 0.3% |
| d | 24458 | 0.2% |
| s | 22951 | 0.2% |
| e | 18173 | 0.1% |
| Other values (35) | 94567 | 0.7% |
PermanentAddress
Categorical
IMBALANCE 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 81.1 MiB |
| , | |
|---|---|
| CHINCHALAKATTI LT,KERUR TQ: BADAMI | 1 |
| Rakam Karnali Garama Dehakh Dist Behari Zone,NEpal | 1 |
| na,na | 1 |
| No,No | 1 |
| Other values (9) | 9 |
Length
| Max length | 70 |
|---|---|
| Median length | 1 |
| Mean length | 1.000307 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1466233 |
|---|---|
| Distinct characters | 60 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | , |
|---|---|
| 2nd row | , |
| 3rd row | , |
| 4th row | , |
| 5th row | , |
Common Values
| Value | Count | Frequency (%) |
| , | 1465770 | |
| CHINCHALAKATTI LT,KERUR TQ: BADAMI | 1 | < 0.1% |
| Rakam Karnali Garama Dehakh Dist Behari Zone,NEpal | 1 | < 0.1% |
| na,na | 1 | < 0.1% |
| No,No | 1 | < 0.1% |
| Kailori village, Ronija post,Nadavai taluk | 1 | < 0.1% |
| NAGALYANDA,NAGALYANDA | 1 | < 0.1% |
| Naduvil Villlage, Podomadattil Mandala Post,,Kannur | 1 | < 0.1% |
| KITHANUR VILLAGE BIDARAHALLI(H),BANGALORE EAST TQ | 1 | < 0.1% |
| HANUR(V) KOLLEGALA ,CH N | 1 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| 1465770 | ||
| tq | 3 | < 0.1% |
| nagara | 2 | < 0.1% |
| village | 2 | < 0.1% |
| thota | 1 | < 0.1% |
| hanur(v | 1 | < 0.1% |
| kollegala | 1 | < 0.1% |
| ch | 1 | < 0.1% |
| n | 1 | < 0.1% |
| mudigere,mudigere | 1 | < 0.1% |
| Other values (40) | 40 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| , | 1465789 | |
| a | 49 | < 0.1% |
| 40 | < 0.1% | |
| A | 27 | < 0.1% |
| N | 17 | < 0.1% |
| l | 17 | < 0.1% |
| i | 16 | < 0.1% |
| o | 16 | < 0.1% |
| r | 15 | < 0.1% |
| n | 15 | < 0.1% |
| Other values (50) | 232 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1466233 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| , | 1465789 | |
| a | 49 | < 0.1% |
| 40 | < 0.1% | |
| A | 27 | < 0.1% |
| N | 17 | < 0.1% |
| l | 17 | < 0.1% |
| i | 16 | < 0.1% |
| o | 16 | < 0.1% |
| r | 15 | < 0.1% |
| n | 15 | < 0.1% |
| Other values (50) | 232 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1466233 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| , | 1465789 | |
| a | 49 | < 0.1% |
| 40 | < 0.1% | |
| A | 27 | < 0.1% |
| N | 17 | < 0.1% |
| l | 17 | < 0.1% |
| i | 16 | < 0.1% |
| o | 16 | < 0.1% |
| r | 15 | < 0.1% |
| n | 15 | < 0.1% |
| Other values (50) | 232 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1466233 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| , | 1465789 | |
| a | 49 | < 0.1% |
| 40 | < 0.1% | |
| A | 27 | < 0.1% |
| N | 17 | < 0.1% |
| l | 17 | < 0.1% |
| i | 16 | < 0.1% |
| o | 16 | < 0.1% |
| r | 15 | < 0.1% |
| n | 15 | < 0.1% |
| Other values (50) | 232 | < 0.1% |
PermanentCity
Text
| Distinct | 693 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 105 |
| Missing (%) | < 0.1% |
| Memory size | 94.5 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 23 |
| Mean length | 10.614335 |
| Min length | 3 |
Characters and Unicode
| Total characters | 15557198 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 52 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Bagalkot |
|---|---|
| 2nd row | Bagalkot |
| 3rd row | Bagalkot |
| 4th row | Belagavi Dist |
| 5th row | Belagavi Dist |
| Value | Count | Frequency (%) |
| city | 410429 | |
| bengaluru | 360518 | |
| dist | 159706 | 7.6% |
| belagavi | 78240 | 3.7% |
| mysuru | 75604 | 3.6% |
| hassan | 63267 | 3.0% |
| tumakuru | 62450 | 3.0% |
| shivamogga | 58104 | 2.8% |
| mandya | 52865 | 2.5% |
| kannada | 50500 | 2.4% |
| Other values (683) | 738144 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2618344 | |
| u | 1430994 | 9.2% |
| r | 1136119 | 7.3% |
| i | 1124739 | 7.2% |
| g | 890011 | 5.7% |
| l | 792070 | 5.1% |
| n | 747638 | 4.8% |
| t | 709241 | 4.6% |
| 644195 | 4.1% | |
| y | 579936 | 3.7% |
| Other values (48) | 4883911 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 15557198 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 2618344 | |
| u | 1430994 | 9.2% |
| r | 1136119 | 7.3% |
| i | 1124739 | 7.2% |
| g | 890011 | 5.7% |
| l | 792070 | 5.1% |
| n | 747638 | 4.8% |
| t | 709241 | 4.6% |
| 644195 | 4.1% | |
| y | 579936 | 3.7% |
| Other values (48) | 4883911 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 15557198 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 2618344 | |
| u | 1430994 | 9.2% |
| r | 1136119 | 7.3% |
| i | 1124739 | 7.2% |
| g | 890011 | 5.7% |
| l | 792070 | 5.1% |
| n | 747638 | 4.8% |
| t | 709241 | 4.6% |
| 644195 | 4.1% | |
| y | 579936 | 3.7% |
| Other values (48) | 4883911 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 15557198 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 2618344 | |
| u | 1430994 | 9.2% |
| r | 1136119 | 7.3% |
| i | 1124739 | 7.2% |
| g | 890011 | 5.7% |
| l | 792070 | 5.1% |
| n | 747638 | 4.8% |
| t | 709241 | 4.6% |
| 644195 | 4.1% | |
| y | 579936 | 3.7% |
| Other values (48) | 4883911 |
PermanentState
Categorical
IMBALANCE 
| Distinct | 38 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 948 |
| Missing (%) | 0.1% |
| Memory size | 92.3 MiB |
| Karnataka | |
|---|---|
| Maharashtra | 8441 |
| Andhra pradesh | 8063 |
| Tamilnadu | 5099 |
| Kerala | 3220 |
| Other values (33) | 11083 |
Length
| Max length | 20 |
|---|---|
| Median length | 9 |
| Mean length | 9.0309359 |
| Min length | 3 |
Characters and Unicode
| Total characters | 13228831 |
|---|---|
| Distinct characters | 45 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Karnataka |
|---|---|
| 2nd row | Karnataka |
| 3rd row | Karnataka |
| 4th row | Karnataka |
| 5th row | Karnataka |
Common Values
| Value | Count | Frequency (%) |
| Karnataka | 1428929 | |
| Maharashtra | 8441 | 0.6% |
| Andhra pradesh | 8063 | 0.6% |
| Tamilnadu | 5099 | 0.3% |
| Kerala | 3220 | 0.2% |
| Telangana | 1757 | 0.1% |
| Uttar pradesh | 1435 | 0.1% |
| West bengal | 1303 | 0.1% |
| Bihar | 1263 | 0.1% |
| Rajasthan | 712 | < 0.1% |
| Other values (28) | 4613 | 0.3% |
| (Missing) | 948 | 0.1% |
Length
| Value | Count | Frequency (%) |
| karnataka | 1428929 | |
| pradesh | 10136 | 0.7% |
| maharashtra | 8441 | 0.6% |
| andhra | 8063 | 0.5% |
| tamilnadu | 5099 | 0.3% |
| kerala | 3220 | 0.2% |
| telangana | 1757 | 0.1% |
| uttar | 1435 | 0.1% |
| west | 1303 | 0.1% |
| bengal | 1303 | 0.1% |
| Other values (38) | 6868 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5802330 | |
| r | 1472350 | 11.1% |
| n | 1448958 | 11.0% |
| t | 1443416 | 10.9% |
| K | 1432149 | 10.8% |
| k | 1429464 | 10.8% |
| h | 40015 | 0.3% |
| d | 24458 | 0.2% |
| s | 22951 | 0.2% |
| e | 18173 | 0.1% |
| Other values (35) | 94567 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 13228831 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 5802330 | |
| r | 1472350 | 11.1% |
| n | 1448958 | 11.0% |
| t | 1443416 | 10.9% |
| K | 1432149 | 10.8% |
| k | 1429464 | 10.8% |
| h | 40015 | 0.3% |
| d | 24458 | 0.2% |
| s | 22951 | 0.2% |
| e | 18173 | 0.1% |
| Other values (35) | 94567 | 0.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 13228831 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 5802330 | |
| r | 1472350 | 11.1% |
| n | 1448958 | 11.0% |
| t | 1443416 | 10.9% |
| K | 1432149 | 10.8% |
| k | 1429464 | 10.8% |
| h | 40015 | 0.3% |
| d | 24458 | 0.2% |
| s | 22951 | 0.2% |
| e | 18173 | 0.1% |
| Other values (35) | 94567 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 13228831 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 5802330 | |
| r | 1472350 | 11.1% |
| n | 1448958 | 11.0% |
| t | 1443416 | 10.9% |
| K | 1432149 | 10.8% |
| k | 1429464 | 10.8% |
| h | 40015 | 0.3% |
| d | 24458 | 0.2% |
| s | 22951 | 0.2% |
| e | 18173 | 0.1% |
| Other values (35) | 94567 | 0.7% |
Nationality_Name
Text
| Distinct | 88 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 86.7 MiB |
Length
| Max length | 30 |
|---|---|
| Median length | 5 |
| Mean length | 5.0020139 |
| Min length | 4 |
Characters and Unicode
| Total characters | 7331847 |
|---|---|
| Distinct characters | 51 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 29 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | India |
|---|---|
| 2nd row | India |
| 3rd row | India |
| 4th row | India |
| 5th row | India |
| Value | Count | Frequency (%) |
| india | 1464637 | |
| indonesia | 238 | < 0.1% |
| nepal | 192 | < 0.1% |
| iran | 84 | < 0.1% |
| haiti | 70 | < 0.1% |
| bangladesh | 68 | < 0.1% |
| thailand | 49 | < 0.1% |
| macedonia | 36 | < 0.1% |
| united | 33 | < 0.1% |
| uganda | 30 | < 0.1% |
| Other values (91) | 480 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1466034 | |
| n | 1465655 | |
| i | 1465362 | |
| d | 1465191 | |
| I | 1465027 | |
| e | 818 | < 0.1% |
| s | 420 | < 0.1% |
| l | 406 | < 0.1% |
| o | 374 | < 0.1% |
| r | 290 | < 0.1% |
| Other values (41) | 2270 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7331847 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1466034 | |
| n | 1465655 | |
| i | 1465362 | |
| d | 1465191 | |
| I | 1465027 | |
| e | 818 | < 0.1% |
| s | 420 | < 0.1% |
| l | 406 | < 0.1% |
| o | 374 | < 0.1% |
| r | 290 | < 0.1% |
| Other values (41) | 2270 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7331847 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1466034 | |
| n | 1465655 | |
| i | 1465362 | |
| d | 1465191 | |
| I | 1465027 | |
| e | 818 | < 0.1% |
| s | 420 | < 0.1% |
| l | 406 | < 0.1% |
| o | 374 | < 0.1% |
| r | 290 | < 0.1% |
| Other values (41) | 2270 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7331847 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1466034 | |
| n | 1465655 | |
| i | 1465362 | |
| d | 1465191 | |
| I | 1465027 | |
| e | 818 | < 0.1% |
| s | 420 | < 0.1% |
| l | 406 | < 0.1% |
| o | 374 | < 0.1% |
| r | 290 | < 0.1% |
| Other values (41) | 2270 | < 0.1% |
PersonType
Categorical
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 91.7 MiB |
| Injured | |
|---|---|
| complainnant | |
| Missing | |
| Deceased | |
| Others | 56198 |
| Other values (6) | 35847 |
Length
| Max length | 22 |
|---|---|
| Median length | 7 |
| Mean length | 8.6119917 |
| Min length | 4 |
Characters and Unicode
| Total characters | 12623311 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Deceased |
|---|---|
| 2nd row | Injured |
| 3rd row | Injured |
| 4th row | Injured |
| 5th row | Injured |
Common Values
| Value | Count | Frequency (%) |
| Injured | 664146 | |
| complainnant | 453219 | |
| Missing | 147206 | 10.0% |
| Deceased | 109167 | 7.4% |
| Others | 56198 | 3.8% |
| Kidnapped | 24187 | 1.7% |
| Rape | 9788 | 0.7% |
| Unidentified Dead Body | 1121 | 0.1% |
| Unidentified Person | 667 | < 0.1% |
| Arrest | 77 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| injured | 664146 | |
| complainnant | 453219 | |
| missing | 147206 | 10.0% |
| deceased | 109167 | 7.4% |
| others | 56198 | 3.8% |
| kidnapped | 24187 | 1.6% |
| rape | 9788 | 0.7% |
| unidentified | 1788 | 0.1% |
| dead | 1121 | 0.1% |
| body | 1121 | 0.1% |
| Other values (3) | 751 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 2199446 | |
| e | 1087275 | 8.6% |
| a | 1050701 | 8.3% |
| d | 827512 | 6.6% |
| i | 777182 | 6.2% |
| r | 721186 | 5.7% |
| u | 664153 | 5.3% |
| I | 664146 | 5.3% |
| j | 664146 | 5.3% |
| c | 562386 | 4.5% |
| Other values (21) | 3405178 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12623311 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 2199446 | |
| e | 1087275 | 8.6% |
| a | 1050701 | 8.3% |
| d | 827512 | 6.6% |
| i | 777182 | 6.2% |
| r | 721186 | 5.7% |
| u | 664153 | 5.3% |
| I | 664146 | 5.3% |
| j | 664146 | 5.3% |
| c | 562386 | 4.5% |
| Other values (21) | 3405178 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12623311 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 2199446 | |
| e | 1087275 | 8.6% |
| a | 1050701 | 8.3% |
| d | 827512 | 6.6% |
| i | 777182 | 6.2% |
| r | 721186 | 5.7% |
| u | 664153 | 5.3% |
| I | 664146 | 5.3% |
| j | 664146 | 5.3% |
| c | 562386 | 4.5% |
| Other values (21) | 3405178 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12623311 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 2199446 | |
| e | 1087275 | 8.6% |
| a | 1050701 | 8.3% |
| d | 827512 | 6.6% |
| i | 777182 | 6.2% |
| r | 721186 | 5.7% |
| u | 664153 | 5.3% |
| I | 664146 | 5.3% |
| j | 664146 | 5.3% |
| c | 562386 | 4.5% |
| Other values (21) | 3405178 |
InjuryType
Categorical
MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 456136 |
| Missing (%) | 31.1% |
| Memory size | 86.3 MiB |
| Fatal | |
|---|---|
| Minor | |
| Not Applicable | |
| Grievous | |
| Abused | 1683 |
Length
| Max length | 14 |
|---|---|
| Median length | 5 |
| Mean length | 7.3371277 |
| Min length | 5 |
Characters and Unicode
| Total characters | 7407909 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Fatal |
|---|---|
| 2nd row | Fatal |
| 3rd row | Fatal |
| 4th row | Fatal |
| 5th row | Fatal |
Common Values
| Value | Count | Frequency (%) |
| Fatal | 349090 | |
| Minor | 315373 | |
| Not Applicable | 221248 | |
| Grievous | 122253 | 8.3% |
| Abused | 1683 | 0.1% |
| (Missing) | 456136 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| fatal | 349090 | |
| minor | 315373 | |
| not | 221248 | |
| applicable | 221248 | |
| grievous | 122253 | 9.9% |
| abused | 1683 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 919428 | |
| l | 791586 | |
| i | 658874 | 8.9% |
| o | 658874 | 8.9% |
| t | 570338 | 7.7% |
| p | 442496 | 6.0% |
| r | 437626 | 5.9% |
| F | 349090 | 4.7% |
| e | 345184 | 4.7% |
| M | 315373 | 4.3% |
| Other values (11) | 1919040 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7407909 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 919428 | |
| l | 791586 | |
| i | 658874 | 8.9% |
| o | 658874 | 8.9% |
| t | 570338 | 7.7% |
| p | 442496 | 6.0% |
| r | 437626 | 5.9% |
| F | 349090 | 4.7% |
| e | 345184 | 4.7% |
| M | 315373 | 4.3% |
| Other values (11) | 1919040 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7407909 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 919428 | |
| l | 791586 | |
| i | 658874 | 8.9% |
| o | 658874 | 8.9% |
| t | 570338 | 7.7% |
| p | 442496 | 6.0% |
| r | 437626 | 5.9% |
| F | 349090 | 4.7% |
| e | 345184 | 4.7% |
| M | 315373 | 4.3% |
| Other values (11) | 1919040 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7407909 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 919428 | |
| l | 791586 | |
| i | 658874 | 8.9% |
| o | 658874 | 8.9% |
| t | 570338 | 7.7% |
| p | 442496 | 6.0% |
| r | 437626 | 5.9% |
| F | 349090 | 4.7% |
| e | 345184 | 4.7% |
| M | 315373 | 4.3% |
| Other values (11) | 1919040 |
Injury_Nature
Text
MISSING 
| Distinct | 1291 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 1438422 |
| Missing (%) | 98.1% |
| Memory size | 45.6 MiB |
Length
| Max length | 50 |
|---|---|
| Median length | 49 |
| Mean length | 7.8716056 |
| Min length | 1 |
Characters and Unicode
| Total characters | 215375 |
|---|---|
| Distinct characters | 101 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 785 ? |
|---|---|
| Unique (%) | 2.9% |
Sample
| 1st row | No Injury |
|---|---|
| 2nd row | grievous |
| 3rd row | grievous |
| 4th row | Miner |
| 5th row | Grievous |
| Value | Count | Frequency (%) |
| minor | 7792 | |
| simple | 4278 | |
| grievous | 3724 | |
| injury | 2532 | 7.2% |
| grevious | 2383 | 6.8% |
| fatal | 1464 | 4.2% |
| nature | 1244 | 3.6% |
| in | 1160 | 3.3% |
| head | 800 | 2.3% |
| not | 511 | 1.5% |
| Other values (773) | 9145 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 15628 | 7.3% |
| I | 13807 | 6.4% |
| r | 12174 | 5.7% |
| e | 10552 | 4.9% |
| o | 10210 | 4.7% |
| R | 9892 | 4.6% |
| M | 9393 | 4.4% |
| G | 8881 | 4.1% |
| S | 8837 | 4.1% |
| n | 8416 | 3.9% |
| Other values (91) | 107585 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 215375 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 15628 | 7.3% |
| I | 13807 | 6.4% |
| r | 12174 | 5.7% |
| e | 10552 | 4.9% |
| o | 10210 | 4.7% |
| R | 9892 | 4.6% |
| M | 9393 | 4.4% |
| G | 8881 | 4.1% |
| S | 8837 | 4.1% |
| n | 8416 | 3.9% |
| Other values (91) | 107585 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 215375 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 15628 | 7.3% |
| I | 13807 | 6.4% |
| r | 12174 | 5.7% |
| e | 10552 | 4.9% |
| o | 10210 | 4.7% |
| R | 9892 | 4.6% |
| M | 9393 | 4.4% |
| G | 8881 | 4.1% |
| S | 8837 | 4.1% |
| n | 8416 | 3.9% |
| Other values (91) | 107585 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 215375 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 15628 | 7.3% |
| I | 13807 | 6.4% |
| r | 12174 | 5.7% |
| e | 10552 | 4.9% |
| o | 10210 | 4.7% |
| R | 9892 | 4.6% |
| M | 9393 | 4.4% |
| G | 8881 | 4.1% |
| S | 8837 | 4.1% |
| n | 8416 | 3.9% |
| Other values (91) | 107585 |
| District_Name | UnitName | Year | Month | age | Caste | Profession | Sex | PresentAddress | PresentCity | PresentState | PermanentAddress | PermanentCity | PermanentState | Nationality_Name | PersonType | InjuryType | Injury_Nature | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Bagalkot | Amengad PS | 2016 | 1 | 14 | Lingayath | Farmer | FEMALE | HUVINAHALLI,TQ-HUANGUND | Bagalkot | Karnataka | , | Bagalkot | Karnataka | India | Deceased | Fatal | NaN |
| 1 | Bagalkot | Amengad PS | 2016 | 1 | 49 | VOKKALIGA | Farmer | MALE | BASAVA NAGAR GOKAK CTS 190/5 PLAT NO 2,GOKAK | Bagalkot | Karnataka | , | Bagalkot | Karnataka | India | Injured | Fatal | NaN |
| 2 | Bagalkot | Amengad PS | 2016 | 1 | 0 | VOKKALIGA | Farmer | MALE | BASAVA NAGAR GOKAK CTS 190/5 PLAT NO 2,GOKAK | Bagalkot | Karnataka | , | Bagalkot | Karnataka | India | Injured | Fatal | NaN |
| 3 | Bagalkot | Amengad PS | 2016 | 1 | 34 | VOKKALIGA | Farmer | MALE | BASAVA NAGAR GOKAK CTS 190/5 PLAT NO 2,TQ-GOKAK | Belagavi Dist | Karnataka | , | Belagavi Dist | Karnataka | India | Injured | Fatal | NaN |
| 4 | Bagalkot | Amengad PS | 2016 | 1 | 36 | VOKKALIGA | Farmer | MALE | BASAVA NAGAR GOKAK CTS 190/5 PLAT NO 2,TQ-GOKAK | Belagavi Dist | Karnataka | , | Belagavi Dist | Karnataka | India | Injured | Fatal | NaN |
| 5 | Bagalkot | Amengad PS | 2016 | 1 | 60 | GANIGA | Housewife | FEMALE | AMBLIKOPPA,TQ-HUNGUND | Bagalkot | Karnataka | , | Bagalkot | Karnataka | India | Deceased | Fatal | NaN |
| 6 | Bagalkot | Amengad PS | 2016 | 1 | 40 | MUSLIM | Driver | MALE | BELAGAVI,TQ-BELAGAVI | Bagalkot | Karnataka | , | Bagalkot | Karnataka | India | Injured | Fatal | NaN |
| 7 | Bagalkot | Amengad PS | 2016 | 1 | 20 | Lingayath | Labourer | MALE | HIREBADAWADAGI,TQ-HUNAGUND | Bagalkot | Karnataka | , | Bagalkot | Karnataka | India | Injured | Fatal | NaN |
| 8 | Bagalkot | Amengad PS | 2016 | 1 | 18 | Lingayath | Farmer | MALE | HIREBADAWADAGI,TQ-HUNAGUND | Bagalkot | Karnataka | , | Bagalkot | Karnataka | India | Injured | Fatal | NaN |
| 9 | Bagalkot | Amengad PS | 2016 | 1 | 55 | VOKKALIGA | Farmer | MALE | GUDUR SC,TQ-HUNAGUND | Bagalkot | Karnataka | , | Bagalkot | Karnataka | India | Deceased | Fatal | NaN |
| District_Name | UnitName | Year | Month | age | Caste | Profession | Sex | PresentAddress | PresentCity | PresentState | PermanentAddress | PermanentCity | PermanentState | Nationality_Name | PersonType | InjuryType | Injury_Nature | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1465773 | Yadgir | Yadgiri Women PS | 2023 | 11 | 15 | BEDARU | Student | FEMALE | R/o Thanagundi,tq dist yadgiri | Yadgir | Karnataka | , | Yadgir | Karnataka | India | Kidnapped | NaN | NaN |
| 1465774 | Yadgir | Yadgiri Women PS | 2023 | 11 | 31 | MADIGA | Nurse | FEMALE | R/o Naykal,Now At Mata Manikeshwari Nagara Yadgiri | Yadgir | Karnataka | , | Yadgir | Karnataka | India | complainnant | Abused | NaN |
| 1465775 | Yadgir | Yadgiri Women PS | 2023 | 11 | 16 | BEGADI | Student | FEMALE | R/o M Hosalli,tq dist yadgiri | Yadgir | Karnataka | , | Yadgir | Karnataka | India | Kidnapped | NaN | NaN |
| 1465776 | Yadgir | Yadgiri Women PS | 2023 | 12 | 19 | MUSLIM | Student | FEMALE | R/o Sagar B,Tq Shahapur dist Yadgiri | Yadgir | Karnataka | , | Yadgir | Karnataka | India | Missing | NaN | NaN |
| 1465777 | Yadgir | Yadgiri Women PS | 2024 | 1 | 22 | LAMBANI | Farmer | FEMALE | SAMANAPURA SANNA THANDA,YADAGIR | Yadgir | Karnataka | , | Yadgir | Karnataka | India | Rape | Not Applicable | NaN |
| 1465778 | Yadgir | Yadgiri Women PS | 2024 | 1 | 19 | CHRISTIAN | Student | FEMALE | R/o Hosalli Cross Near Ratnama School,yadgiri | Yadgir | Karnataka | , | Yadgir | Karnataka | India | Missing | NaN | NaN |
| 1465779 | Yadgir | Yadgiri Women PS | 2024 | 1 | 16 | HOLAYA, HOLER, HOLEYA | House help - hired | FEMALE | R/o Talak Village,tq dist yadgir | Yadgir | Karnataka | , | Yadgir | Karnataka | India | Kidnapped | NaN | NaN |
| 1465780 | Yadgir | Yadgiri Women PS | 2024 | 2 | 29 | Lingayath | Teacher | FEMALE | R/o Bilahar Village,Tq wadagera dist Yadgiri | Yadgir | Karnataka | , | Yadgir | Karnataka | India | complainnant | Abused | NaN |
| 1465781 | Yadgir | Yadgiri Women PS | 2024 | 2 | 29 | REDDY | House help - hired | FEMALE | R/o Thanagundi Village,Now at Mini Vidanasouda yadgiri | Yadgir | Karnataka | , | Yadgir | Karnataka | India | complainnant | Abused | NaN |
| 1465782 | Yadgir | Yadgiri Women PS | 2024 | 2 | 17 | KABBALIGA | Student | FEMALE | R/o Bandalli,TQ DIST YADGIR | Yadgir | Karnataka | , | Yadgir | Karnataka | India | Kidnapped | NaN | NaN |
Most frequently occurring
| District_Name | UnitName | Year | Month | age | Caste | Profession | Sex | PresentAddress | PresentCity | PresentState | PermanentAddress | PermanentCity | PermanentState | Nationality_Name | PersonType | InjuryType | Injury_Nature | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1731 | Bengaluru City | Cyber Crime Police Station | 2019 | 7 | 0 | VOKKALIGA | Farmer | MALE | , | Bengaluru City | Karnataka | , | Bengaluru City | Karnataka | India | complainnant | NaN | NaN | 39 |
| 11289 | Mangaluru City | Moodabidre PS | 2021 | 6 | 36 | KURUB | Police officer | MALE | POLICE STATION,MOODABIDRE | Mangaluru City | Karnataka | , | Mangaluru City | Karnataka | India | complainnant | Not Applicable | NaN | 26 |
| 15664 | Yadgir | Kembhavi PS | 2024 | 2 | 0 | VOKKALIGA | Labourer | FEMALE | R/o:Kalladevanahalli,Tq:Hunasagi | Yadgir | Karnataka | , | Yadgir | Karnataka | India | Injured | Minor | NaN | 24 |
| 8252 | Hassan | Pension Mohalla PS | 2021 | 5 | 59 | NAYAK | Police officer | MALE | PSI PMPS,HASSAN | Hassan | Karnataka | , | Hassan | Karnataka | India | complainnant | Not Applicable | NaN | 20 |
| 9131 | Kalaburagi | Afzalpur PS | 2018 | 3 | 0 | VOKKALIGA | Police officer | MALE | Afzalpur Police Station,TQ: Afzalpur | Kalaburagi | Karnataka | , | Kalaburagi | Karnataka | India | complainnant | Fatal | NaN | 20 |
| 9134 | Kalaburagi | Afzalpur PS | 2018 | 4 | 0 | VOKKALIGA | Police officer | MALE | Afzalpur Police Station,TQ: Afzalpur | Kalaburagi | Karnataka | , | Kalaburagi | Karnataka | India | complainnant | Fatal | NaN | 20 |
| 4033 | Bengaluru Dist | Nelamangala Traffic PS | 2017 | 8 | 0 | VOKKALIGA | Farmer | MALE | , | Bengaluru Dist | Karnataka | , | Bengaluru Dist | Karnataka | India | Injured | Minor | NaN | 19 |
| 1724 | Bengaluru City | Cyber Crime Police Station | 2018 | 12 | 0 | VOKKALIGA | Farmer | MALE | , | Bengaluru City | Karnataka | , | Bengaluru City | Karnataka | India | complainnant | NaN | NaN | 18 |
| 9103 | Kalaburagi | Afzalpur PS | 2017 | 6 | 0 | VOKKALIGA | Police officer | MALE | Afzalpur Police Station,TQ: Afzalpur | Kalaburagi | Karnataka | , | Kalaburagi | Karnataka | India | complainnant | Fatal | NaN | 18 |
| 3061 | Bengaluru City | Sampangiramanagar PS | 2021 | 3 | 30 | LAD | Self Employed Others | FEMALE | NOTKNOW,NOTKNOW | Bengaluru City | Karnataka | , | Bengaluru City | Karnataka | India | Others | Not Applicable | NaN | 17 |